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q^L The Classical KAM-Theorem 

A. The purpose of this lecture is to describe the Kam theorem in its most 
• £^>asic form and to give a complete and detailed proof. This proof essentially follows 
^^he traditional lines laid out by the inventors of this theory, Kolmogorov, Arnold and 

£-jWoser (whence the acronym 'Kam'), and the emphasis is more on the underlying 
ideas than on the sharpness of the arguments. After all, Kam theory is not only a 
collection of specific theorems, but rather a methodology, a collection of ideas of 
how to approach certain problems in perturbation theory connected with 'small 
divisors'. 

B. The classical Kam theorem is concerned with the stability of motions 
in hamiltonian systems, that are small perturbations of integrable hamiltonian 
systems. These integrable systems are characterized by the existence of action angle 
coordinates such that the hamiltonian depends on the action variable alone - see 
[2, 14] for details. Thus we are going to consider hamiltonians of the form 

H(p,q) = h(p) + f e (p,q), fe(p,q) = e/*(p,<?,e) 

for small e, where p — (p%, . . . ,p n ) are the action variables varying over some domain 
D C R", while q = (q\, . . . , q n ) are the conjugate angular variables, whose domain 
is the usual n-torus T™ obtained from K n by identifying points whose components 
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differ by integer muitipies of 2ir. Thus, f e has period 2ir in each component of q. 
Moreover, all our hamiltonians are assumed to be real analytic in all arguments. 
The equations of motion are, as usual, 

p=-H q (p,q), q = H p (p,q) 

in standard vector notation, where the dot indicates differentiation with respect to 
the time t, and the subscripts indicate partial derivatives. The underlying phase 
space is D x T" C M" x T" with the standard symplectic structure 



The hamiltonian vector field Xh associated with the equations of motions then 
satisfies v(Xh, •) = —dH. 

We assume that the number n of degrees of freedom is at least 2, since one 
degree of freedom systems are always integrable. 

C. For e = the system is governed by the unperturbed, integrable hamilton- 
ian h, and the equations of motion reduce to 

p = 0, q = u) 



w = h p (p). 

They are easily integrated - hence the name integrable system - and their general 
solution is 

p(t)=Po, <l(t) = <7o + v(p )t. 

Hence, every solution curve is a straight line, which, due to the identification of the 
q-coordinates modulo 2tt, is winding around the invariant torus 



with constant frequencies, or winding numbers, u>(po) — (coi(po), ■ ■ ■ ,w n (po)). Such 
tori with linear flow are also called Kronecker tori. 

In addition these tori are Lagrangian. That is, the restriction of the symplectic 
form v to their tangent space vanishes, and their dimension is maximal with respect 
to this property. 




with 



7 P0 = { Po } x T 
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Thus, the whole phase space is foliated into an n-parameter family of invariant 
Lagrangian tori T PoJ on which the flow is linear with constant frequencies u)(po). - 
This is the geometric picture of an intcgrable hamiltonian system. 

It should be kept in mind that due to the introduction of action angle co- 
ordinates these solutions are related to 'real world solutions' by some coordinate 
transformation, which is periodic in q\, . . . , q n . Expanding such a transformation 
into Fourier series and inserting the linear solutions obtained above, the 'real world 
solutions' are represented by series of the form 

Z(p 0l q + cu(p Q )t)= J2 a fe (Po)e<^°> + << fe ^°», a k eR 2n , 
fceZ" 

where (•,•) denotes the usual scalar product. Thus, every solution is now quasi- 
periodic in t: its frequency spectrum in general does not consist of integer multiples 
of a single frequency - as is the case with periodic solutions -, but rather of integer 
combinations of a finite number of different frequencies. In essence, the 'real world 
solutions' are superpositions of n oscillations, each with its own frequency. Moreover, 
these quasi-periodic solutions occur in families, depending on the parameter q , 
which together fill an invariant embedded n-torus. 

Let us return to action angle coordinates. We observe that the topological 
nature of the flow on each Kronecker torus crucially depends on the arithmetical 
properties of its frequencies w. There are essentially two cases. 

1. - The frequencies u> are nonresonant, or rationally independent: 

(k,u}^0 for all O^eZ". 

Then, on this torus, each orbit is dense, the flow is ergodic, and the torus itself is 
minimal. 

2. - The frequencies uu are resonant, or rationally dependent: that is, there 
exist integer relations 

(k,u) = for some ^ k £ Z". 

The prototype is u = (ui, . . . , ui n - m , . . . , 0) , with 1 < m ^ n — 1 trailing zeroes 
and nonresonant (u>\, . . . , w„_ m ). In this case the torus decomposes into an m- 
parameter family of invariant n — m-tori. Each orbit is dense on such a lower 
dimensional torus, but not in T" . 

A special case arises when there exist m = n — 1 independent resonant 
relations. Then each frequency u>\, . . . ,u> n is an integer multiple of a fixed non-zero 
frequency uj* , and the whole torus is filled by periodic orbits with one and the same 
period 27r/w». 
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In an integrable system the frequencies on the tori may or may not vary with 
the torus, depending on the nature of the frequency map 

h p : D — > Q,, uj{p) = h p (p), 

where f2 C K™. We now make the assumption that this system is nondegenerate in 
the sense that 

duo 

det h vv = det — — 7^ 
dp 

on D. Then h p is an open map, even a local diffeomorphism between D and some 
open frequency domain £1 C K n , and 'the frequencies lu effectively depend on the 
amplitudes p\ as a physicist would say. It follows that nonresonant tori and resonant 
tori of all types each form dense subsets in phase space. Indeed, the resonant ones 
sit among the nonresonant ones like the rational numbers among the irrational 
numbers. 

This 'frequency-amplitude-modulation' is a genuinely nonlinear phenomenon. 
By contrast, in a linear system the frequencies are the same all over the phase space. 
As we will see, this is essential for the stability results of the Kam theory. As it is 
said, 'the nonlinearities have a stabilizing effect'. 

D. Now we consider the perturbed hamiltonian. The objective is to prove the 
persistence of invariant tori for small £ ^ 0. 

The first result in this direction goes back to Poincare and is of a negative nature. 
He observed that the resonant tori are in general destroyed by an arbitrarily small 
perturbation. In particular, out of a torus with an n— 1-parameter family of periodic 
orbits, usually only finitely many periodic orbits survive a perturbation, while the 
others disintegrate and give way to chaotic behavior. - So in a nondegenerate system 
a dense set of tori is usually destroyed. This, in particular, implies that a generic 
hamiltonian system is not integrable [6, 9, 28]. 

Incidentally, it would not help to drop the nondegeneracy assumption to avoid 
resonant tori. If h is too degenerate, the motion may even become ergodic on each 
energy surface, thus destroying all tori [7]. 

A dense set of tori being destroyed there seems to be no hope for other tori 
to survive. Indeed, until the fifties it was a common belief that arbitrarily small 
perturbations can turn an integrable system into an ergodic one on each energy 
surface. In the twenties there even appeared an - erroneous - proof of this 'ergodic 
hypothesis' by Fermi. 

But in 1954 Kolmogorov observed that the converse is true - the majority 
of tori survives. He proved the persistence of those Kronecker systems, whose 
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frequencies lu are not only nonresonant, but are strongly nonresonant in the sense 
that there exist constants a > and r > such that 

Oi 

(1) |(fc, w )|^ forallO^fceZ", 

where |fc| = |fci| + • • • + |fc n | . Such a condition is called a diophantine or small 
divisor condition. 

The existence of such frequencies is easy to see. Let A^ denote the set of all 
lo S K™ satisfying these infinitely many conditions with fixed a and r. Then A£, is 
the complement of the open dense set 

R a = U R a,k> 

where 

iT aifc = { W €R n :|(MI<«/l*r}- 

Obviously, for any bounded domain f2 C K™ , we have the Lebesgue measure estimate 
m(R T ak nn) = 0(a/\k\ T+1 ), and thus 

m(R T a n n) < £ m ( R «,k n fi) = 0(a) , 

provided that r > n — 1. Hence, i? T = P| Q >o ^ s a se ^ °f measure zero, and its 
complement 



A T = (J K 



ct>0 

is a set oi full measure in R", for any r > n — 1. In other words, almost every u> 
in R™ belongs to A T , r > n — 1, which is the set of all u) in R n satisfying (1) for 
some a > while t is fixed.. 

As an aside we remark that A r = for t < n — 1, because for every 
nonresonant w, 

(2) min |/fc, w )|<_M- 

by Dirichlet's pigeon hole argument. And for r = n — 1, the set A™^ 1 has measure 
zero, but Hausdorff dimension n - see [17] for references. So there are continuum 
many diophantine frequencies to the exponent n — 1, although they form a set of 
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measure zero. - But here we will fix r > n — 1 and drop it from the notation, letting 

E. But although almost all frequencies are strongly nonresonant for any fixed 
t > n — 1 , it is not true that almost all tori survive a given perturbation f £ , no 
matter how small e. The reason is that the parameter a in the nonresonancc 
condition limits the size of the perturbation through the condition 

e < a 2 . 

Conversely, under a given small perturbation of size e, only those Kronecker tori 
with frequencies lo in A a with 

a > \ft, 

do survive. Thus, we can not allow a to vary, but have to fix it in advance. 
To state the Kam theorem, we therefore single out the subsets 

£l a C SI, a > 0, 

whose frequencies belong to A Q and also have at least distance a to the boundary 
of f2. These, like A Q , are Cantor sets: they are closed, perfect and nowhere dense, 
hence of first Baire category. But they also have large Lebesgue measure: 

m(Q-ft a ) = 0(a), 

provided the boundary of f2 is piecewise smooth, or at least of dimension n — 1 so 
that the measure of a boundary layer of size a is 0(a). 

The main theorem of Kolmogorov, Arnold and Moser can now be stated as 
follows. 

The Classical Kam Theorem ([1, 8, 10]) Suppose the integrable hamil- 
tonian h is nondegenerate, the frequency map h p is a diffcomorphism D — > f2, and 
H — h + f e is real analytic on D x T" . Then there exists a constant S > such 
that for 

\e\ < Sa 2 

all Kronecker tori (T n ,tj) of the unperturbed system with lo e Q, a persist as 
Lagrangian tori, being only slightly deformed. Moreover, they depend in a Lipschitz 
continuous way on lo and fill the phase D x T™ up to a set of measure O (a) . 



Section 1: The Classical KAM-Theorem 7 



Here, 'real analytic on D x T n ' means that the analyticity extends to a uniform 
neighbourhood of D. 

Is is an immediate and important consequence of the Kam theorem that small 
perturbations of nondegenerate hamiltonians are not ergodic, as the Kronecker tori 
form an invariant set, which is neither of full nor of zero measure. Thus the ergodic 
hypothesis of the twenties was wrong. 

It has to be stated again, however, that this invariant set, although of large 
measure, is a Cantor set and thus has no interior points. It is therefore impossible to 
tell with finite precision whether a given initial position falls onto an invariant torus 
or into a gap between such tori. From a physical point of view the Kam theorem 
rather makes a probabilistic statement: with probability 1 — 0(a) a randomly chosen 
orbit lies on an invariant torus and is thus perpetually stable. 

F. We conclude with some remarks about the necessity of the assumptions of 
the Kam theorem. 

First, neither the perturbation nor the integrable hamiltonian need to be real 
analytic. It suffices that they are differentiable of class C l with 

I > 2t + 2 > 2n 

to prove the persistence of individual tori [12, 15, 21]. For their Lipschitz dependence 
some more regularity is required [16]. 

The nondegeneracy condition may also be relaxed. It is not necessary that 
the frequency map is open. Roughly speaking, it suffices that the intersection of its 
range with any hyperplane in R™ has measure zero. For example, if it happens that 

h p (p) = (wi(pi),...,w n (f>i)) 

is a function of p± only (and thus completely degenerate), it suffices to require that 



For a more general statement see [18], and [19, 23, 27] for proofs. 

Finally, the hamiltonian nature of the equations is almost indispensable. 
Analogous result are true for reversible systems [13, 16]. But in any event the 
system has to be conservative. Any kind of dissipation immediately destroys the 
Cantor family of tori, although isolated ones may persist as attractors. 
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2 The KAM Theorem with Parameters 

A. Instead of proving the classical Kam theorem directly, we are going to 
deduce it from another Kam theorem, which is concerned with perturbations of a 
family of linear hamiltonians. This is accomplished by introducing the frequencies 
of the Kronecker tori as independent parameters. This approach was first taken 
in [11]. 

To this end we write p = po + I and expand h around po so that 

h(p) = h{ Po ) + (h p {p Q ),r)+ f {i-t)(h pp ( Pt )i,i)dt, 

Jo 

where p t = po + tl. By assumption, the frequency map is a diffeomorphism 

h p : D — > O, po i-> u — h p (p ). 

Hence, instead of p 6 D we may introduce the frequencies u £ ft as independent 
parameters, determining p$ uniquely. - Incidentally, the inverse map is given as 

: D. -> D, uj p = g u (w), 

where g is the Legendre transform of h, defined by g(uS) = sup J9 ((p,a;) — h(p)). See 
[2] for more details on Legendre transforms. 
Thus we write 

h{p) = e(u) + <w,7) + P h {I,u) 

with 

P h = [ {l-t)(h pp (p t )I,I)dt, 
Jo 

and 

fe(p,q) = Pe(I,q,oj) = ef*(po + I,q,e). 

Writing also 9 instead of q for the angular variables, we obtain the family of 
hamiltonians H — N + P with 

N = e(u) + (u,I), P = P h (I,w) + P e (I,6,w). 

They are real analytic in the coordinates (I, 0) in BxT", B some sufficiently 
small ball around the origin in R™ , as well as in the parameters to in some uniform 
neighbourhood of fL 
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This family is our new starting point. For P = it reduces to the normal 
form N = e(u>) + There is an invariant Kronecker torus 

7 U = {0} x T n 

with constant vector field 

for each uj g f2, and all these tori are given by the family 

$ : FxSl^BxT", (6, w) >-> (0, 6) 

of trivial embeddings of T™ over Q, into phase space. Moreover, each such torus 
clearly is Lagrangian. Our aim is to prove the persistence of a 'subfamily' of such 
Lagrangian Kronecker tori under sufficiently small perturbations P ^ over the 
Cantor set fl a C of frequency parameters uj. - Thus, instead of proving the 
existence of a Cantor family of invariant tori in one hamiltonian system, we first 
prove the existence of one invariant torus within a Cantor family of hamiltonian 
systems. 

This change of perspective has several advantages. - The unperturbed 
hamiltonian is as simple as possible, namely linear. This simplifies the Kam proof. 
- The frequencies are separated from the actions. This makes their role more 
transparent. For example, the Lipschitz dependence of the tori on lj is easily 
established. - Generalizations such as weaker nondegeneracy conditions and 
extension to infinite dimensional systems are easier. Also, this approach lends itself 
to applications in bifurcation theory, where systems naturally depend on parameters. 

B. To state the basic result quantitatively we need to introduce a few notations. 

Let 

D r<s = {I : \I\ <r}x{6: \Q6\ < s} C C" x C" 

and 

O h = {uj : \w-Cl a \ <h}c C" 

denote complex neighbourhoods of the torus {0} x T n and f2 a , respectively, where 
| • | stands for the sup-norm of real vectors. The sup-norm of functions on D r s x Oh 
is denoted by \-\ rtS h - 
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We will also consider the Lipschitz constants of mappings with respect to to . 
We define 

M, = sup l^>-*HI , 

v^u \V — (J\ 

where the underlyingdomain will be clear from the context or indicated by a 
subscript. 

Theorem A Let H = N + P. Suppose P is real analytic on D r>s x Oh with 
\ P \r,s,h ^ jars", as" ^ h, 

where v = r + 1 and 7 is a small constant depending only on n and r . Suppose also 
that r,s,h $J 1. Then there exists a Lipschitz continuous map tp : £l a — » f2 close to 
the identity and a Lipschitz continuous family of real analytic torus embcddings 
$ : T n x fl a — > £? x T" dose to <& , such that for each uj £fl a the embedded tori 
are Lagrangian and 

X H \ , n o $ = D$-X N . 

Moreover, $ is reai analytic on T* — {0 : \Q0\ < s/2} for each lj, and 
\W($-$ )\,as u \W($-$ Q )\ c <-SL-\P\ rtaih , 
|^-id|,as"| ¥) -id| jC < l\P\ r . h , 



uniformly on x f2 Q and f2 Q , respectively, where c is a constant depending only 
on n and t, and W = diag(r _1 Id, s _1 Id). 

By slight abuse of notation we wrote Xm also for the analogous constant vector 
field on T™ alone. - The theorem states that for each uj in £l a there is an embedded 
invariant Kronecker torus T w = &(T n ,uj) with frequencies u) for the hamiltonian 
vector field Xh at the slightly shifted parameter value ui — (p(u>) . Conversely, for 
each cu in the slightly deformed Cantor set 

(l a = <p(Cl a ) c £1, 

the vector field Xh-|_ admits an invariant Kronecker torus T w with frequencies u — 
ip^ 1 ^). Each such torus is Lagrangian and close to the corresponding unperturbed 
torus. 
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C. The Lipschitz estimates allow us to control the measure of fl a and its 
complement. To this end we first extend ip to a lipeomorphism tp of fL 

Proposition 1 The map ip can be extended to a lipeomorphism cp : D. — > fi 

with 

l^- id k^ < max (l^- id L,n Q l^- id ln Q ) ■ 

Proof. Let "0 be any coordinate function of ip — id defined on f2 Q . Define ip 
on fi c = R" - to be zero. Then 

Mc^uf}" s$ max^-id| £ na |c^> - id| n J < 1. 

According to Appendix B we can extend ip to a function ip on all of IR™ preserving 
its Lipschitz constant . Doing this with every coordinate of ip we obtain an extension 
<p of tp such that <p = id on f2 c and 

l^- id L, R " = I^L,n a un« < L 

Hence <p is a lipeomorphism on K™. Since it is the identity outside of O, it is also a 
lipeomorphism of f2, extending ip. [°] 

Proposition 2 For Cl a = (p(Q a ) the estimate 
m{fl-n a ) = 0(a) 
holds, where the implicit constant depends only on CI . 

Proof. Let (p be the extension of Proposition 1. Then 

m(Cl — Cl a ) = m(Q — ip(Cl a )) 
= m(Q — ip(Cl a )) 
= m((p(Cl — Cl a )) 

= 0(a). @ 

The Cantor family of torus embeddings may also be extended to a family 
over Q. This extension can even be chosen so that the additional embedded tori are 
still Lagrangian, though of course not invariant. 
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Proposition 3 There exists an extension of <& to a Lipschitz continuous 
family of real analytic torus embcddings 

!> : T" x -> B x T" 

such that each embedded torus is Lagrangian, and the estimates for <I> are the same 
as for though with a different constant c. 

D. We thus arrive at the following conclusion. 

Theorem B Suppose the assumptions of Theorem A are satisfied. Then 
there exist a lipcomorphism p : — > f2 close to the identity and a family of torus 
embcddings 

!> : TP x Q R" x T" 

close to <&o such that for every parameter value 

Q £ (l a = <P{^a) 

the hamiltonian vector field Xh\^ admits an invariant Lagrangian Kronecker torus 
T w = i>(T",u;), where u> = p^ 1 ^). Moreover, the estimates for (p and 3> are the 
same as for tp and in Theorem A, though with a different constant c, and 

m{Q-Q a ) = 0(a), 

where the implicit constant depends only on O . 

We will see at the end of section 5 that the map tp actually can be assigned 
w-derivatives of every order on the Cantor set f2 Q . This may be formalized by 
introducing the intrinsically defined notion of a differentiable function on an arbitrary 
closed set [25, 26]. The point is that - due to the Whitney extension theorem - 
such functions can be extended to functions on the whole space with the same 
differentiability properties. The upshot is that there even exists an extension of 
ip to a C°° -function <p on il. The same applies to $ and leads to the notion of 
smooth foliations of invariant tori over Cantor sets [16]. 

D. We now prove the classical Kam theorem. Introducing the frequencies as 
parameters we wrote the hamiltonian as H = N + P , where 

P = Ph + Pc 

is real analytic on B x T" x f2 , B some small ball around the origin in E™ . Thus we 
can fix some small h and s, with s v h, so that P is real analytic on the complex 



Section 2: The KAM Theorem with Parameters 13 



domain D TtS x Oh for all small r, and so that 

\P\r,e,h «S \ P hksM + \PfX,.,h < Mr 2 + Fe, 

where M is a bound on the hessian of h and F — sup p g, e) | - 

To meet the smallness condition of Theorems A and B, we choose r by requiring 
that Mr 2 = Fe and arrive at the condition 



2Fe < 7«rs 1/ = ^as v \ 



I Ft 



or 



So there is a 5 depending on n, t and H such that Theorems A and B apply for 
e ^ 5a 2 . 

By construction, an orbit (I(t),6(t)) for the hamiltonian H at the parameter 
value Cj translates into an orbit (p(t),q(t)) = (po(u)+I(t),6(t)) for this hamiltonian 
in p, ^-coordinates. It therefore follows with Theorem B that the mapping 

which is a composition of <l and <p with 

BxFxO^DxT", 

(I,6,u ) )~(h-\w) + I,6), 

is, for every u> £ Q a , an embedding of an invariant Lagrangian Kronecker torus 
(T n ,ui). Moreover, if? is Lipschitz close to the real analytic unperturbed embedding 

* : T" x ft -> D x T\ 

It follows that the measure of the complement of all those tori in the phase space is 
bounded by a constant times the measure of il a x T™, hence is 0(a). This finishes 
the proof of the classical Kam theorem. 
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3 Outline of the Proof of Theorem A 

A. We prove Theorem A by a rapidly converging iteration procedure that was 
proposed by Kolmogorov [8] . At each step of this scheme a hamiltonian 

Hj = Nj + Pj 

is considered, which is a small perturbation of Nj = ej + (w,J). A coordinate and 
parameter transformation 3^- is constructed such that 

Hj o Jj- = N j+1 + P j+1 

with another normal form iVj+i and a much smaller error term Pj+i- Namely, 

\P j+1 \<C\Pj\ K 

with some exponent k > 1 . Repetition of this process leads to a sequence of 
transformations 3^0,3^, . . . , whose products 

P ' = Jo o 3 r i o . . . o Jj-i 

converge to an embedding of an invariant Kronecker torus. 

In the meantime a number of other proofs have been given, for example by 
formulating some generalized implicit function theorem suited for small divisor 
problems [29], or by referring to an implicit function theorem in tame Frechet 
spaces [4]. Recently, Salamon and Zehnder [22] gave a proof that avoids coordinate 
transformations altogether and works in configuration space. Also, Eliasson [5] 
described a way of using power series expansions and majorant techniques in a very 
tricky way. 

But here we stick to the traditional method of proof, as it probably is the most 
transparent way to get to know the basic techniques. They are indeed quite flexible 
and robust, and not at all restricted to perturbations of integrable hamiltonian 
systems. As we mentioned in the beginning, these techniques rather amount to a 
strategy of how to approach a large class of perturbation problems. 

B. To describe one cycle of this iterative scheme in more detail we now drop 
the subscript j. 

First, the perturbation P is approximated by some hamiltonian R by truncat- 
ing its Taylor series in / at first order and its Fourier series in 9 at some suitable high 
order K . The approximation error P — R will be small, and we now consider the 
hamiltonian H = N + R instead of H = N + P. The purpose of this approximation 
will become clear later. 
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The transformation J consists of a parameter dependent symplectic change of 
coordinates $ and a change if of the parameters alone: 

F= (*,¥>): (1, 0,w) .-(*(/, 0,w) )¥ >(a;)). 

Moreover, this coordinate transformation is of the form 

(7,0,u;)~(C/(J,0, W ),y(0,u;)), 

where [7 is affine in I , and V is independent of I. Such transformations £F form a 
group S under composition. 

We obtain <I> as the time-l-map of the flow X F of a hamiltonian F, which is 
affine in I. That is, <I> is the time-l-shift determined by the equations of motion 

I = -F e (I,6,w), 6 = Fr(e,u;). 

Then <I> is symplectic for each w. 

To describe the transformed hamiltonian H o<J> we recall that for a function K, 

^KoX* F = {K,F}oX t F , 
the Poisson bracket of K and F evaluated at X F . Indeed, 

^-KoXt = V K e 6 j + K I .i j 
dt F t=o ^ 3 3 3 3 

= E Ke^-K^Fe^iK^}, 

and the general formula follows. 

So we can use Taylor's formula to expand H o $ = Ho X F \ with respect 
to t at and write 

iJo$ = NoX F \ t=1 + RoX F \ t=1 

= N+{N,F}+ [ {l-t){{N,F} ,F}oX F dt 
Jo 

+ R+ f {R,F}oX F dt 
Jo 

= N+{N,F} + R+ f {{l-t){N,F} + R,F}oX F dt. 
Jo 
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This is a linear expression in R and F - the linearization of Ho<5> - plus a quadratic 
integral remainder. That is, if R and F are both roughly of order e, then the integral 
will roughly be of order e 2 <C e and may be ascribed to the next perturbation P + . 
The point is to find F such that 

N + {N,F} + R= N + 

is again a normal form. Equivalently, we want to solve 

(3) {F,N} + N = R, N = N + -N 

for F and AT when R is given. Suppose for a moment that such a solution exists. 
Then (1 - t) {N,F} + R = (1 - t)JV + tR, and altogether we obtain 

Ho<S> = Ho<f>+(P-R)o<I> = N + + P + 

with iV + = AT + N and 

(4) P+= f {(l-t)N + tR,F}oX t F dt+(P-R)oX$, 

Jo 

as the new error term. 

C. Let us consider equation (3) first on a formal level. Clearly, 

d u F :={F,N}= ]T F ej N I} = £ ^F e . 

is a first order partial differential operator on the torus T" with constant coeffi- 
cients ijj. Expanding F into a Fourier series, 

f= j2 F ^ i(k ' d ^ 

with coefficients depending on / and u>, we find 

Thus, admits a basis of eigenfunctions e 1 ^"' 6 ^ with eigenvalues i{k,u>), k 6 Z'". 
In other words, diagonalizes with respect to this basis. 

If lu is now nonresonant, then these eigenvalues are all different from zero 
except when k = 0. We then can solve for all Fourier coefficients Rk of the given 
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function R, except for Rq, which is given by the mean value of R over T", 



Hence, if R is given, then we can always solve the equation d u F — R— [R] at least 
formally by setting 



We are still free to add a ^-independent function to F, but we choose to normalize 
F so that [F] — 0. - Finally, equation (3) is completely solved by setting 



Of course, this choice of N is in no way uniquely determined, but this is in some 
sense the simplest one. 

D. There is a more systematic interpretation of the preceding construction. For 
irrational uj, the domain of d u , consisting of all formal Fourier series in 9 (ignoring 
the other coordinates here), splits into two invariant subspaces, its nullspace N 
consisting of all constant functions, and its range ft, consisting of all series with 
vanishing constant term. Moreover, d& is invertible on ft. 

Decompose R into its respective components in N and ft, 

R = Rn + Rji- 

The projection onto IN" is given by taking the mean value, so 

Rjsi = [R] , R% — R — [R]. 
The equation 

d u F + N = R = Roi + Rk 
is then solved by 'solving componentwise', 

N = i?3\r = [R], d u F = R^ = R- [R], 

where the latter can be solved uniquely for F in ft, since is invertible on ft. 

This general procedure - 'solve for all the terms you can solve for, and keep 
the rest' - is at the basis of all normal form theory. It just happens to take a 
particularly simple form in our case. 




(5) 




N=[R]. 
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E. So far our considerations were formal. But in estimating the series repre- 
sentation (5) of F , we are confronted with the well known and notorious problem 
of 'small divisors'. Even if lu is nonresonant, infinitely many of the divisors (k,Lu) 
become arbitrarily small in view of (2) , threatening to make the series (5) divergent. 

This divergence is avoided, if lu is required to be strongly nonresonant. To 
formulate this key lemma, let A s denote the space of all analytic functions u defined 
in the complex strip {9 : sup^ \5s0j\ < s} C C" with bounded sup-norm |u| s over 
that strip. Let 

A s = {u e A s : [u] = 0}, 

and recall that lu £ satisfies 

\(k,uj)\ > a/\k\ T for all ^ k € Z". 

Lemma 1 Suppose that to e . Then the equation 

d u u = v, v e Aq, 
has a unique solution u in Uo<c<s -^-o ~° ■> w tth 

\ u \s-v^^\ v \ s i 0<a<.s, 
where the constant c depends only on n and r. 



Proof. We prove the lemma with o T+n in place of o T to avoid lengthy 
technicalities. The interested reader is referred to [17] or [15] for a proof of the 
sharper result. - Expanding u and v into Fourier series, the unique formal solution 
u with [u] = is 



Uk.L 



Vk c i(k,6) 



As to the estimate we recall that the Fourier coefficients of an analytic function on 
T" decay exponentially fast: 



\v k \ < \vLe 



-\h\s 



where |fc| = \k\ \ + • • • + |fc„| . See Lemma A.l for a reminder. Together with the 
small divisor estimate for to we obtain 
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e \k\(s-a) ^ 




-\k\a 



\{k,L>)\ 



a 



fe#o 



The infinite sum is now easily estimated by constant times a T " . 

We observe that 9" 1 is unbounded as an operator in Aq. It is bounded only as 
an operator from Aq into the larger spaces Aq~ ct , with its bound tending to infinity 
as a tends to zero. This phenomenon is known as 'loss of smoothness' affected by 
the solution operator d~ l , and is the main culprit why small divisor problems are 
technically so involved. During the iteration we have to let a — ► in order to stay 
in the classes Aq. But then the operator d^ 1 is getting unbounded. By the rapid 
convergence of the Newton scheme, however, the error term converges to zero even 
faster, thus allowing to overcome this effect of the small divisors. 

It is absolutely essential for Lemma 1 to be true that uo satisfies infinitely 
many small divisor conditions, thus restricting to to a Cantor set with no interior 
points. On the other hand, we will also need to transform the frequencies and thus 
want them to live in open domains. This conflict is resolved by approximating P by 
a trigonometric polynomial R. Then only finitely many Fourier coefficients need to 
be considered at each step, and only finitely many small divisor conditions need to 
be required, which are easily satisfied on some open w-domain. Of course, during 
the iteration more and more conditions have to be satisfied, and in the end these 
domains will shrink to some Cantor set. 

F. We still have to finish one cycle of the iteration. Solving (3) we arrive at 
H o $ = N + + P + , where 



since [R] is affine in / and independent of 9. To write N + again in normal form, 
we have to introduce 



as new frequencies. Since v is small, there exists an inverse map <p : uj + i— > lo by 
the implicit function theorem - see Appendix A. With this change of parameters, 



N + = N + [R] = e+(cj) + (uj + v(oj) ,1) , 



LO + = LO + v(io) 



+ = e +(^+) + ( w +^> 



is again in normal form. This finishes one cycle of the iteration. 
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4 The Kam Step 

A. To avoid a flood of constants we will write 

u < v, u ■< v, 

if there exists a positive constant c 1 , which depends only on n and r and could 
be made explicit, such that u ^ cv and cu ^ v, respectively. - Now let P be a real 
analytic perturbation of some normal form N . 

The Kam Step Suppose that \P\ r s h < e with 
(a) e •< aijrcr 1 ', (b) e ■< hr, (c) < 

for some < ry < 1/8, < ct < s/5 and K ^ 1, where 1/ = r + 1 . Then there exists 
a real analytic transformation 

J = ($, : D nryS _ ha x 0, l/4 -> D r , s x O h 

in the group S such that H o J = N + + P + with 

\P + \^s- 5a j l/4 <^ + (v 2 + K n e-^)e. 

Moreover, on Z^r.s-Sa x Oh and Oh/ 4 the estimates 

\W(^> - id) I , |W(£>$ - Id)^ 1 ! < — — , 

b-idl ,/i|£V-Id| < -, 
r 

hold with the weight matrix W = diag(r _1 Id, cr _1 Id). 

B. The proof of the Kam Step follows the lines of the preceding section and 
consists of six small steps. Except for the last step everything is uniform in Oh, 
whence we write |-| rs for |-| rs?l throughout. 

1. Truncation. We approximate P by a hamiltonian R, which is affine in I 
and a trigonometric polynomial in 9. To this end, let Q be the linearization of P 
in / at I = Q. By Taylor's formula with remainder and Cauchy's estimate - see 
Appendix A for a reminder -, we have 



\Q\r,.<e> \P-Q\2 v r, s <V 2 e. 



Section 4: The Kam Step 21 



Then we simply truncate the Fourier series of Q at order K to obtain R. By 
Lemma A. 2, 



\R-Q\ rs _ a< K n e-"°e. 
Since the factor K n e~ Ka will be made small later on, we also have 

\K,s-* < e - 

See Appendix A for some remarks about this truncation of Fourier series. 

2. Extending the small divisor estimate. The nonresonance conditions (1) are 
assumed to hold on the real set fi a only. But assumption (c) implies that 

cy 

(6) \(k,v)\>wrr>T for all ± < K 
2\k\ 

for all co in the neighbourhood Oh of fl a . Indeed, for every uj £ Oh there is some 
o>* G fl a with \u — w*| < h, hence 

\{k,u - < \k\ \u - u*\ H Kh ^ — < — —r 

for |fc| ^ if. Together with the estimate (1) for this proves the claim. 

3. Solving the linearized equation {F, N} + N = R. We solve this equation as 
described in the preceding section. We have N = [R] and thus 

W\ r < \R\ r , s -„ < e. 

We can solve for F uniformly for all u in Oh because of (6) and the fact that R 
contains only Fourier coefficients up to order K , by truncation. Hence the estimate 
of Lemma 1 applies as well, and we obtain a real analytic function F with 

I PI tr l^kizg € 6 

With Cauchy we get \Fg\ r s _ 3cr < e/aa T+1 and |-Fj| r / 2 s-ia < e l OLVIjT , hence 

1 m , 1 \f z \ < 

uniformly on -D r /2. s -3cr with = r + 1 . 
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4- Transforming the coordinates. The coordinate transformation <I> is obtained 
as the real analytic time-l-map of the flow X F of the hamiltonian vector field Xp, 
with equations of motions 

i = -F e , e = Fj. 

With assumption (a) and the preceding estimates we can assure that we have 
\Fg\ W ^ r /8 and \Fj\ ^ a on -D r /2. s -3<r uniformly in u>. Therefore, the 
time-l-map is well defined on D r / i s _ ia , with 

(7) <& = Xp\ t=1 : A-/4,s-4cr — > Dr/2,s-3a, 

and 

1^-idKi^K^, 

on that domain for <f> = (U, V). The Jacobian of $ is 

since F is linear in /, hence Fj and V are independent of /. By the preceding 
estimates and Cauchy, 

ara v aa u ^ L ara v 

on the domain D r /% s _ 5r7 D Z) 7?r>s _5 cr . This proves the estimates for Finally, we 
observe that \U — id| \Fg\ ^ r\r implies that also $ : D nr , s ^^ a — > D2 nr , s -ia ■ 

5. New error term. To estimate P + as given in (4) we first consider the term 
{R,F}. Again, by Cauchy's estimate, 



\{R,n\r/2,.-3* < \RA \ F 0\ + \Re\ \Fl\ 



e e e 
< " + 



r aa" a ara u 1 
e 2 



< 
ara" 

The same holds for \{N ,F}\ r j 2 S _ 3(T . Together with (7) and r\ < | we get 
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/ {(l-^N + tR^FjoX^dt 
Jo 



r}r,s — 5er 

^\{(l-t)N + tR,F}\ r/2>s _ Aa <^. 
The other term in (4) is bounded by 

\(P-R)°®\r,r, s - 5 *<\P-R\2 V r, s -±* 

^ \P ~ Q\2r)r,s-ia + \Q ~ ^W2Trr,a-4ff 

< (t? 2 + if "e^) e. 

These two estimates together give the bound for | P + \ . 

6. Transforming the frequencies. Finally, we have to invert the map 

lo i ► uj + = lu + v(lo), V — Nl = [Ri] 

to put N + N back into a normal form N + . With assumption (b) and Cauchy's 
estimate we can assure that 

Mh/2 = \Nl\ h/2 <l<^ 

The implicit function theorem of Appendix A applies, and there exists a real analytic 
inverse map ip : Oh/ a —* Oh/2, UJ + >- » w , with the estimates 

lip - id! , h \Dip. - Id] < - 
r 

on O/j/4- Setting AT+ = (iV + N) o ip the proof of the Kam Step is completed. 



5 Iteration and Proof of Theorem A 

A. We are going to iterate the Kam Step infinitely often, choosing appropriate 
sequences for the parameters a, r\ and so on. To motivate our choices, let us start by 
fixing a geometric sequence for a, say, a + = a/2, where the plus sign indicates the 
corresponding parameter value for the next step. Let r + = rjr, and let us consider 
the weighted error terms 



ar + a'l 



Then we have 
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E + < — + ( V 2 + K n e- Ka )-. 

Suppose we can choose rj and K so that rj 2 — E and K n e~ Ka ^ E. Then 

E+< V - 1 E 2 =E K , k=|. 

That is, E+ ^ c K ~ x E K for some constant c determined by the Kam Step and 
depending only on n and r. Consequently 

cE+ ^c K E K , 

and this scheme converges exponentially fast for E < c _1 . 
It remains to discuss our assumptions 

(d) i] 2 = E 

(e) K n e- Ka ^ E 

as well as assumptions (a-c) of the Kam Step. There is no obstacle to take (d) as 
the definition of r/, as this implies E ■< r/ and thus (a) for E sufficiently small. The 
other three conditions amount to 

-■<fc<^. K n e- Ka ^E = 

They are combined into the sufficient condition 

(8) K v o v e- Ka <i Ea v = — •< - < — . 

ar a 2K V 

Now it suffices to set up geometric sequences for K and h such that, say, 

K+CT+ = 2Ka, K v + h+ ^ K v h. 

Then these inequalities hold inductively, provided they hold initially, with K o~q 
sufficiently large. In particular, after fixing a 0l K and E , we may set 

ho = ac Q E o-Q 

with a suitable constant Co . 

B. We are now ready to set up our parameter sequences. Let 

o~j hj 
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where (To = so/20, and Kq is chosen so large that the left hand side of (f) is smaller 
than its right hand side, and small enough so that Eq can be fixed to meet (a) 
and (d). In addition, Kq(To has to be so large that the left hand side of (f) decreases 
at least at an exponential rate n. Thus we need 1 •< Kqo-q. Subsequently, we fix 
ho as above. 
Next, let 

E j+1 = c*- 1 E 1 ;, r i+1 =tj J -r i) Vj=E jt 
where is still free and c given by the Kam Step. Finally, we define 

Sj+i = Sj - ha 3 

and the complex domains 

= {|/| < Tj } x {|90| < Sj }, 
Oj = {\lu - Q a \ < hj}. 

Note that Sj -> s/2 and r,- -> 0. - Now let H = N + P . 

Iterative Lemma Suppose Pq is real analytic on D x O with 

l p oL, S o,hc ^ e ° : = aE oro°o- 
Then for each j ^ there exists a normal form Nj and a real analytic transformation 

IP ' = J o . . . o : Dj x Oj -> D x 0„ 

in the group S such that H o P = iVj + Pj with 

Moreover, 

|W'o(5' J+1 -3 rJ )| < 

on -0^+! x Oj + i with the weight matrix W = diag(rQ~ Id, o-q - Id, /i^" Id). 

Proof. Letting 5F = id, there is nothing to do for j = 0. To proceed by 
induction, we have to check the assumptions of the Kam Step for each j ^ . But 
(a) is satisfied by the definition of r/j and the sufficiently small choice of Eq, and 
(b-c) hold by the definition of hj and Kj and the choice of their initial values. 
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We obtain a transformation 
5j : l)j , i x O j+ i -» />, x 3 - 
taking if,- = iVj + P, into Hj o jF,- = Nj + \ + Pj+i with 

\P j+ i\<e j E j +( V * + K?e- K ^)e j 

Since vT x E] = E* = <?- K E j+u we obtain ^ e,-+i by an appropriate choice 

of c as required. Thus, the transformation = J 3 o Jj = J o . . . o 3^- takes 7? 
into A/j + i + Pj+i with the proper estimate for Pj+i- 

The estimate of & requires a bit more, though elementary work. We observe 
that the estimates of the Kam Step and Cauchy imply 

- id) I , m(D^ - Id)^ 1 ! < max (-^-, ^i-) < 



yarjafrjhjj " r,/i, 

on -Dj+i x Oj+i, where D denotes the Jacobian with respect to I, 9 and u), and 
Wj = diag^Id, ajhd, hjhd). We then have 



\w {P +1 - = \w {? j o ?j - p)\ 

s? IWoDFW^l IWjpj - id) I < |Wj(3j- - id) I < 



provided we can bound the first factor in the second row on the domain Dj x Oj . 

But by induction we have — DJ'q o . . . o D5F,_i, with the Jacobians 

evaluated at different points, which we do not indicate. Since IWiWi+J ^5 1 for 
all i, we can use a telescoping argument and the inductive estimates for the 3^ to 
obtain 



< IWo-DJoWo" 1 ! IVFoV^r 1 ! x 



Tjhj 

3 



This is uniformly bounded and small, since ej/rjhj converges rapidly to zero. 
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C. We prove Theorem A by applying the Iterative Lemma to the hamiltonian 
H = N + P, letting P = P, r = r, s = s. We have h ■< adj < as" < h by 
construction and assumption, and 

\ P n\ro,s ji < \ p \ r ,s,h < ( < r f ars < e o = aE a r a^ 

by fixing the constant 7 in Theorem A sufficiently small. 

By the estimates of the Iterative Lemma the maps SP converge uniformly on 

P| Dj x Oj = T* x n a , = {0} x {|Q0| < s/2}, 

to a map £F consisting of a family of embeddings $ : T" x f2 Q — > D x T™ and a 
parameter transformation if : fl a — > f2 , which are real analytic on T ra and uniformly 
continuous on Q Q . Moreover, 

I Wot? - id) I < "' 



uniformly on x fi Q by the usual telescoping argument 

From the estimate \H o IP ' — Nj \ < ej on Dj x Oj we obtain 

\Wj{J(D&yS/Ho& - JVA/j)| < 



' 3 W 



on x Oj with Wj = diag(r ■ 1 Id, er- 1 Id). The symplectic nature of the map & 

~ — 1 

and the uniform estimate of WqDJ 3 W, above then imply 



\X H o9 j -D&-X N \ < — 



on T* x f2 a for all j, where Xj? is the hamiltonian vector field of N = {u),I). Going 
to the limit we obtain 



X H oJ= D<f>X N . 

Thus, <& is an embedding of the Kronecker torus (T n ,uj) as an invariant torus of 
the hamiltonian vector field Xu at the parameter tp(u>) . Moreover, this torus is 
Lagrangian, since 

$*v — lim (&)*v\ = lim w| = 
by the symplecticity of the $ J . 
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D. Let us now look at the w-derivatives of the . Since Ej converges to zero 
at an exponential rate, we have 

► for all to ^ 0. 

r h m 

Hence, all ^-derivatives of the 3^ converge uniformly on T* x f2 Q , and we could 
assign w-derivatives of any order to the limit map 3 r on the Cantor set fl a [25]. 
Without making this concept precise, however, we can at least conclude that J is 
Lipschitz continuous in u. Its Lipschitz norm is bounded by the limit of the bounds 
on the first ^-derivatives of the 3^ . On x Q a , the usual Cauchy estimate yields 

E. We finally look at the estimate of 3 r . So far it does not reflect the actual 
size e of the perturbation, since we fixed Eg and thus eo independently of e. But 
we observe that everything is still all right if in all the estimates for Pj , 3j and , 
the 6j are scaled down by the linear factor 

— = aE Q r o-Q. 

Scaling down our estimates of 2f by this factor we can finally extract our estimates 
of $ and tp as stated in Theorem A, since aa^ ■< ho. This finishes the proof. 



A Some Facts about Analytic Functions 

A. First we recall a variant of the Cauchy estimate, which is used over and over. 
Let D be an open domain in C™, let D r — {z : \z — D\ < r} be the neighbourhood 
of radius r around D, and let F be an analytic function on D r with bounded 
sup- norm \f\ r - Then 

I fa- 1 

I J z i I r— p p r 

for all < p < r and 1 j ^ n. This follows immediately from the Cauchy estimate 
for one complex variable. 

B. Next we give the estimate for the Fourier coefficients of an analytic func- 
tion v on T™ used in the proof of Lemma 1 . Recall that A s denotes the space of all 
functions on T™ bounded and analytic in the strip {6 : < s}. 
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Lemma A.l If v e A s , then v = J^k Vk^ k,e ^ with 
\v k \^\v\ s e-^ s , fceZ". 

Proof. The Fourier coefficients v k of v are given by 

(2lT) n J T n 

Since the integral of an analytic function over a closed contractible loop in any of 
the coordinate planes is zero, and since v is 27r-periodic in each argument also in the 
complex neighbourhood, the path of integration may be shifted into the complex, 
so that 

Vk = 79^ / <° - ^)e-' l{k - e - iv) d0 

for any constant real vector ip with \ip\ < s. Choosing ip = (s — cr)(ei, . . . , e n ) with 
< a < s and ej — sgn kj, 1 ^ j ^ n, we obtain 



\v k \ < 1-uL e" 



-\k\(s 



for all a > 0. Letting a — > the lemma follows, \°\ 

We can now also estimate very roughly the remainder, when we truncate the 
Fourier series of v at order K to obtain Tr-d — Y^,\k\<K w fc e ^ & '^ ■ 

Lemma A. 2 If v e A s and 1 Ko > 1, then 

\v - T K v\ s _ a ^ CK n e- K ° \v\ s , O^a^s, 

where the constant C only depends on n. 

Proof. With Lemma A.l, 

\v-T K v\ s _„< Me |fe|(s - ff) 

\k\>K 

<\< E e-l fe l CT <K£ 4nr " le "' CT ' 

\k\>K 1>K 

lr The condition Ko 1 is missing in the original version of this paper that appeared in print. I 
am thankful to San Vu Ngoc for pointing this out to me. 
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by summing first over all k with |fe| = I, whose number is bounded by 4™/™ . The 
last sum is then easily bounded by a constant times K n e~ Ka . [°] 

There are much more efficient ways to approximate a periodic function v by 
trigonometric polynomials. The above crude way amounts to multiplying the Fourier 
transform v of v with a discontinuous cut off function. Instead, one should multiply 
v with a smooth cut off function ipx- F° r instance, one could take tpxi^) = ''Pix/K), 
where ip is a fixed function, which is 1 on the ball |a;| ^ |, vanishes outside the ball 
^ 1, and between and 1 otherwise. Transforming back, 

(viPk) = v * i>K 

amounts to a convolution of v with a real analytic approximation of the identity 
i/jk, as K — > oo. Such smoothing operators have many interesting properties. For 
more details, see for example [29]. 

C. We finally formulate a version of the implicit function theorem for analytic 
maps, which we need to invert the frequency map during the Kam Step. Recall 
that Oh is an open complex neighbourhood of radius h of some subset f2 of M. n . 
In the following, |-| denotes the sup-norm for vectors and maps, and the induced 
operator-norm for Jacobians. 

Lemma A. 3 Suppose f is a real analytic map from Oh into C™. If 
\f - id\ Qh < S < h/ A, 
then f has a real analytic inverse ip on Oh/4 ■ Moreover, 

\<p-id\, j\Dtp-U\<5 
on this domain. 

Proof. Let rj = h/A. Let u, v be two points in 2 rj with f(u) = f(v). Then 

u - v = (u - f(u)) - (v - f(v)) , 

hence |u — v\ < 26 < 2n. It follows that the segment (1 — s)u + sv, < s < 1, is 
strictly contained in O^. Along this segment, 

9 = max\Df-I\ <5/n<l 

by Cauchy's inequality and so 

\u-v\ < \Df-I\\u-v\ <9\u-v\ 
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by the mean value theorem. It follows that u — v. Thus, / is one-to-one on the 
domain Oi-q- 

By elementary arguments from degree theory the image of O277 under / covers 
Orf, since |/ — id| < S. So / has a real analytic inverse <p on O^, which clearly 
satisfies \(p — id| < 5. Finally 

{Dip-IiKUDf^-lL 



~ 1 - \Df - I\ 2v ~ 1 - S/2 V ~ v 
by applying Cauchy to the domain Oi n ■ 



B Lipschitz Functions 

Let B c R™ be a closed set. We prove the basic fact - used in section 2 - 
that a Lipschitz continuous function u : B — > R can be extended to a Lipschitz 
continuous function U : R™ — * R without affecting its Lipschitz constant 

\u(x) — u(y)\ 

x,yGB p J/l 

where on R 71 we may take any norm | • | . That is, we have 

l7 l B =' u ' \ U \cp> = \ U \c,B > 

Incidentally, for this extension _B could be any point set. 
Indeed, £7 is given by 

{/(a;) = sup (u(z) - X\z - x\) , x e R n , 

where A = \u\ L B . By the triangle inequality, 

(u(z) — X \z — y\) (u(z) — X \z — x\) — X \x — y\ . 

Taking suprema over z on both sides we obtain U(y) ^ U(x) — X\x — y\, or equiva- 
lently U{x) — U(y) ^ A \x — y\. Interchanging x and y we obtain 

\U(x)-U(y)\^X\x-y\, 

whence l^l^ r« ^ M,c b- We leave it to the reader to check that U = u on B. 
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